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ABSTRACT 

We discuss methods to compute significance levels for the existence of quasi-periodic oscillations (QPOs) in 
Active Galactic Nuclei (AGN) which take the red-noise character of the X-ray lightcurves of these objects into 
account. Applying epoch folding and periodogram analysis to the XMM-Newton observation of the Seyfert 
galaxy Mrk 766, a possible QPO at a timescale of 4200 s has been reported. Our computation of the significance 
of this QPO, however, shows that the 4200 s peak is not significant at the 95% level. We conclude that the 4200 s 
feature is an artifact of the red-noise process and not the result of a physical process within the Active Galactic 
Nuclei. 

Subject headings: accretion disks — galaxies: individual (Mrk 766) — galaxies: Seyferts — X-rays: galaxies 



1. INTRODUCTION 

X-ray quasi-periodic oscillations (QPOs) are among the most 
important o bservational properti es of galactic X-ray binaries 
(XRBs; see van der Klis, 2000| , for a recent review), yield- 
ing important constraints on the mass of the central black hole 
(BH), Mbh, and providing theoretical clues and constraints on 
the operative physical processes and geometry in the regime of 
strong g ravity. Althou gh their origin is still a matter of scientific 
debate ( Psaltis, 2000 , and references therein), there is general 
agreement that QPOs originate close to the central BH. 

To date, QPOs at timescales of a few kiloseconds have 
also been claimed for some active galac tic nuclei (AGN): 
NGC 4151 ( jFiore, Massaro & Perola, 1989b, NGC 6814 (M it- 
taz & Branduardi- Raymont, 1989), NGC 5548 (P apadakis 
& Lawrence, 1993), NGC 4051 (Papadakis & Lawrence , 



1996) 



1995), RX J0437.4 -4711 (Halpern & Marshall 
I RAS 18325-5926 fllwasawa et al., 1998b, MCG^6 -30-15 
(|Lee et al., 2000b, Mrk 766 ([Boiler et a l., 2001b, and 



IRAS 13224-3809 ( ffefferkorn et al„ 2001[ ). In addition, 
possible long term periodicities with periods of months have 
been claimed for the radio loud AGN Mrk 421, Mrk 501, and 
PKS 2155-304 ( |Osone, Teshima & Mase, 200l| ). Some of the 
kilosecond QPO findings, however, are controversial, with the 
strongest EXOS AT result (NGC 5548) being disputed by T agli- 
aferri et al. (1996), who attribute it mostly to periodic swapping 
of detectors. NG C 6814 turned out to be confused with a cat a- 
clysmic variable ( [Vladejski et al., 1993[ [Staubert et al., 1994 - 

Given the important implications based upon detection of 
QPO in AGN, it seems worthwhile to study the methods em- 
ployed to determine their significance in detail. In this Let- 
ter, we present a study of how to compute this significance us- 
ing Monte Carlo simulat ions of lightcurves with the method of 
Timmer & Konig (1995 ) and using periodogram analysis and 
epoch folding to detect the periodicity (§|2|). We then apply 
our methods to a reanalysis of the XMM-N ewton lightcurve of 
Mrk 766 and find that the QPO claimed in [Boiler et al. (200 1| ) 
has in fact low statistical significance ( 
results and comment on further work. 



2. SIGNIFICANCE OF QPO DETECTIONS 

Currently, two methods for the variability analysis of astro- 
nomical sources, especially in searching for periodic signals, 
are common in astronomy: periodogram analysis and epoch 
folding. We only give a brief description of these m ethods here, 
see, e .g., [van der Klis (1989b, Leahy et al. (1983) and D avies 



(1990) for in-depth discussions. Based on the Fourier decom- 
position of the lightcurve, periodogram analysis (often called 
power spectrum density analysis, PSD) is especially sensitive 
to periodic signals with a modulation that is close to sinusoidal. 
On the other hand, epoch folding (x 2 (P)) is based on compar- 
ing pulse profiles obtained from binning the data into phase bins 
at a test period, P, with a constant count rate using a x 2 test. 
In both methods, the detection of a periodic signal is claimed if 
the PSD value or the x 2 value at the period of interest is signifi- 
cantly above the values of the testing statistics surrounding this 
period. 

In order to safely use either of these methods, it is critically 
important to accurately estimate the significance of QPO fea- 
tures. This is not trivial for real AGN lightcurves, which have 
much worse sampling and signal to noise than X-ray binary 
lightcurves. Were the X-ray lightcurve purely dominated by 
white noise and evenly sampled, the significance could be eas- 
ily determined from the statistical properties of the PSD or from 
the x 2 (P) statistics. This assumption, however, does not apply 
to AGN, where the PSD can be well approximated by a power- 
law P(f) oc f~P with ~ 1 . . . 2 for the frequencies of inter- 
est ( Lawrence & Papadakis, 1993|; Green, McHardy & Lehto, 



In SMI we discuss our 



1993; [Konig & Timmer, 1997| ; [Edelson & Nandra, 1999b - 

When studying a feature in x z {P) or a PSD, we can ask two 
different questions: 1 . What is the significance of a QPO peak at 
a given (predefined) frequency (the "local significance" of the 
QPO), and, 2. what is the significance of a QPO feature seen in 
a given frequency range (the "global significance" of the QPO). 
Note that these questions are really different questions, since 
we do have more knowledge about the QPO in the case of the 
"local significance" (we do know its frequency), while in the 
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Fig 1 — Thrpp pnamplpt f typical red- noise lightcurves for a process with an / 19 spectrum binned to a resolution of 100 s using the method described 

by rimmer & Konig (19951. Upper panels: Simulated red-noise lightcurves. Lower panels: x 2 (P) curves from epoch folding (solid line) of the corresponding 



simulations. The dasl 
simulations using the 



erl lines represent the 99 % 95%, and 68% "local significance" levels (see text for definition) obtained for a sample of 5000 Monte Carlo 
rimmer & Konig (19951 method. The dotted line represents the 99% significance level obtained with the "phase randomization" method, 
which overestimates the significance of the peaks in the x 2 (P) curves. Note that the presence of one or more peaks in the x 2 



2 (P) curve is far from unusual for 



red-noise lightcurves. 



case of the "global significance", we are only interested in hav- 
ing the question answered that a feature somewhere in the inter- 
esting frequency range is significant or not. To our knowledge, 
however, no formulae for determining the significance of a peak 
in the PSD or in x 2 (P) exist for such red-noise lightcurves. A 
common approach , therefore, is to resort to Monte Carlo sim- 
ulations (see, e.g., frforne & Baliunas, 1986 ). In these simula- 
tions, a large number (> 1000) of lightcurves with the same 
statistical properties and same temporal sampling as the origi- 
nal lightcurve are generated and their \ 2 (P) is computed. 

For computing the "local significance", one determines the 
statistical distribution of the resulting x 2 (P) value at the fre- 
quency of interest. A deviation of the measured x 2 (P) from the 
red-noise x 2 (P) lS significant if it is above a certain threshold 
determined from this distributi on. Typically, threshold value s 
of 99% or even 99.9% are used (Bevington & Robinson, 1992). 



seen in a red-noise PSD is to be determined. One usually starts 
with a model PSD and performs an inverse Fourier transfor- 
mation to obtain a lightcurve. An o ften used algorithm , called 
the "phase randomization" method (Done et al., 1992), deter- 



An identical procedure can be used to test the significance of a 
peak in the PSD. 

For determining the "global significance", a similar approach 
is used. Since we are looking at x 2 (P) values determined 
at different periods, one has to take the red-noise character 
of the data and window effects into account. Instead of di- 
rectly using x 2 (P) values, more robust tests can be devised 
by first dividing the computed individual x 2 (P) values by the 
average (x 2 (P)), obtained by averaging the x 2 (P) curves of 
the simulated lightcurves. To determine the "global signifi- 
cance" of the maximum x 2 (-P)-peak of an observed lightcurve, 
we therefore propose to compare this value to the distribu- 
tion of £ max := max {x 2 (P)/ (x 2 (P))} from the simulated 
lightcurves. Here, the maximum is to be taken over the pe- 
riod range of interest. We would consider an observed peak as 
likely due to a physical effect only if it is above the 99% thresh- 
old determined from the distribution. The maximum X 2 {P)~ 
peaks present in the red-noise lightcurves of Fig. |ljb and c have 
a "global significance" of 61% and 80% respectively. Similar 
global tests can also be devised for the distribution of the 2nd or 
3rd largest x 2 (P) value and for the significance of the largest 
peaks in a PSD, although we will not use them here. 

It is crucially important how lightcurves are produced by 
Monte Carlo simulations, if the significance level of any peak 



mines the Fourier amplitude from the square root of the power- 
law shaped PSD and assumes the Fourier phase to be uniformly 
distributed in [0, 2ir[. Although the PSDs produced by "phase 
randomizati on" are oc / -/3 , it was pointed out by Timmer & 
Konig (1995) and |Papadakis & Lawrence (1995[ ) that the re- 
sulting lightcurves do not resemble the pure red-noise process 
in all of their statistical properties. First, this procedure chooses 
a deterministic amplitude for each frequency and only random- 
izes the phases. All simulated lightcurves thus exhibit a trend 
caused by the dominating lowest frequency. Secondly, the pe- 
riodogram of a red-noise lightcurve must obey the usual peri- 
odogram statistics: the PSD follows approximately a x 2 distri- 
bution with two degrees of freedom, x\, i- e -' the standard devi- 
ation of each PSD point is of the same magnitude as the PSD 
valu e itself such that the periodogram is fluctuating wildly (see, 
e.g., van der Klis, 1989). "Phase randomization" does not take 
into account this randomness of the periodogram according to 
the x\ distribution and therefore the uncertainty of the related 
distribution of the estimated periods is significantly underesti- 
mated. In other words, red-noise PSDs have a larger scatter 
than those obtained from "phase randomization" - including 
the possibility of outliers that strongly deviate from the general 
/ _/3 behavior. This is important since these outliers could be 
interpreted as quasi periodic oscillations, while in reality they 
result from the statistics of the red-noise process. We note that 
the frequent occurrence of random peaks in red-noise PSDs is 
avoided in X-ray binary work by averaging many PSD s to ob- 
tain the "t rue PSD" of a source (^Jowak et al., 1999|; v an der 
Klis, 1989). For AGN, averaging the PSDs is unfortunately 
not possible due to the prohibitively long observation times re- 
quired. 

As we have mentioned above, the "phase randomization" al- 
gorithm will not produce lightcurves with the appropriate red- 
noise statistical characteristics. Indeed, were one to use the 
Monte Carlo approach outlined above to lightcurves computed 
with "phase randomization", the QPO significance would be 



3 



overestimated. Instead, an algorithm that produces lightcurves 
with the correct statistical red-noise behavior has to be used. 



For this purpose Timmer & Konig (1995 ) proposed a new algo- 
rith m. This algorithm (used also by G reen, McHardy & Done, 
1999) allows randomness both in phase and in amplitude pro- 
ducing the desired x% dist ribution in the periodogr am. In the 
following, we will use the Timmer & Konig (1995 ) algorithm 
to simulate red-noise lightcurves. As an illustration, three sim- 
ulated examples of red-noise lightcurves with significance lev- 
els obtained from the Monte Carlo simulations are displayed in 
Fig. |l[ We also display the significance levels obtained with 
"phase randomization" to indicate that these levels are in fact 
much smaller than the correct levels and therefore would imply 
an overestimate in the significance of an apparent period. 

We note that Poisson noise introduces additional "observa- 
tional noise" in the measured lightcurves. This observational 
noise has to be added to the simulated lightcurves after the in- 
verse Fourier transform has been performed. For each time bin, 
the number of observed photons is drawn from a Poisson distri- 
bution with its mean given by r(t)At, where r(t) is the simu- 
lated count rate and At is the binning. For data with new instru- 
ments such as XMM-Newton, with a high signal to noise ratio, 
this latter step can be ignored if the source is bright enough. For 
example, with At — 100 s, "observational noise" contributes 
less than 3% to the Mrk 766 lightcurve. For earlier instruments 
such a simplification of the Monte Carlo algorithm is not pos- 
sible. We note that in earlier work employing "phase random- 
ization" and the addition of observational noise, the resulting 
PSD statistics would asymptotically approach the \\ distribu- 
tion and therefore the overestimation of the period significance 
would be less than with newer data with a high signal to noise 
ratio. 
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FIG. 2.— XMM-Newton EPIC-pn lightcurve of Mrk 766 for the 0.2-2keV 
band with a resolution of 100 s. 

3. THE CASE OF MRK 766 

We now apply the methods outlined in the previous section 
to the putative QPO in XMM-Newton data of Mrk 766 taken 
during revoluti on 0082. We concent rate on data from the EPIC- 
pn instrument ( Striider et al., 2001 ) which was operated in the 
small window mode during the observation. 

3.1. Data Extraction 

We extracted source photons from a circle of 9.5 pixels ra- 
dius centered on the source (detector coordinates (37.5,54)). 
For the background, data from an off-axis position (18,17.5) 
extracted with a circle of then same radius were used. The 
time range of t he lightcurve was c hosen to be consistent with 
the approach of Boiler et al. (2001) and results in an exposure 
time of 29 ksec. After background subtraction, we corrected the 



measured count-rates as is appropriate for the ~71% live-time 
duri ng the 5.7 ms readout cycle of the pn-CCD (K uster et al., 
1999). Fig. |2| displays the resulting lightcurve. 

3.2. Lightcurve Analysis 

We display the PSD and the x 2 ( p ) 

curves in Fig. || (the fre- 
quency range is 0.4 x 10~ 4 Hz to 30 x 10~ 4 Hz with 57 inde- 
pendent frequencies, the period range is 2000-5000 s with 132 
test periods). A peak at a period of ^4200 s (corresponding 
to a frequency of ~ 2.5 x 10~ 4 Hz) that is consistent with the 



period claimed by Boiler et al. (2001 ) is seen. In order to deter- 
mine the significance of the peaks seen in Fig. we computed 
significance levels using the methods outlined in section 

Before we can perform these simulations, we need to de- 
termine the shape of the PSD. For this p urpose, we apply the 
"response meth od" of [Done et al. (1992b and Green, McHardy 
& Done (1999), where a model power spectrum generated 
through the combination of a large number of red noise simu- 
lated lightcurves is compared to the observed power spectrum. 
Varying the slope and normalization of the model periodogram, 
the best % 2 fit gives a slope of (3 = 1.9, and a normalization 
determined by the variance of the original XMM lightcurve 
(X 2 /dof = 17.89/55). This low value indicates that the observed 
PSD is fully consistent with a power law and that no additional 
components, such as a QPO, are required. 
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FIG. 3. — " psri r pghy pt al. (1983) normalization and best power-law 
fit using the Done et all (1W2| ) method (continuous linesj^and, b x 2 (P) curve 
(continuous line) of the XMM Mrk 766 lightcurve of Fig. El The set of dashed 
lines in both panels represent in ascending order the 68%7»5%, and for 99% 
"local significance" levels for a set of 5000 Monte Carlo red-noise simulations 
with/3 = 1.9. 

We note, however, that the short duration of the observation 
only allows for a poor determination of the PSD shape. Meth- 
ods complementary to testing the consistency of the PSD with 
a power law should be used. We therefore simulated the de- 
sired 5000 lightcurves from a f- 1 9 -PSD, with 29000 s of du- 
ration using a sampling interval of 100 s, a mean count rate /i = 
9.15 counts s , a lightcurve variance a 2 — 2.46 counts 2 s -2 . 
Examples of such simulated red-noise lightcurves are shown in 
Kg. 1 
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To obtain the "local significance" of the peak at ^4200 s , 
we calculated % 2 (P) and the PSD of each of the 5000 Monte 
Carlo realizations. For each trial period, we then computed the 
distribution of the x 2 (P) an d PSD values at this trial period 
from all realizations. These distributions were used to deter- 
mine the 99%, 95%, and 68% "local significance" levels using 
the method described in section ||. Our results are shown in 
Fig. [3l The peak at ^4200 s claimed as QPO by B oiler et al. 
(2001) is below the 95% significance curve. 
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FIG. 4. — Probability distribution of the £ max values (see text for defini- 
tion) from the 5000 red-noise simulated lightcurves in the 2000-5000 s pe- 
riod range. The "global significance" indicates the probability of finding a 
lightcurve with a maximum peak with a value greater than the correspondent 
?max value. The dashed line marks the position of the £ max value of the 4200 s 
feature in Mrk 766 in the 2000-5000 s period range. The "global significance" 
of this peak corresponds to a value of 45%. 

We also applied the "global significance" test to the Mrk 766 
lightcurve for the period range from 2000 s to 5000 s. Fig. 
shows the probability distribution of £ max , which corresponds 
to the probability of finding a red-noise lightcurve with a nor- 
malized maximum peak value less than or equal to the corre- 
sponding £ max value. For Mrk 766, the putative QPO peak 
lies at the 45% mark. In other words, ^50% of the simulated 
red-noise lightcurves show peaks that are more significant than 
the peak observed in Mrk 766. We conclude that the observed 
4200 s "QPO" in Mrk 766 may be an artifact of the red-noise 
process and not the result of a physical process. 

As we mentioned above, the value of (3 determined from the 
observations is rather uncertain. However, our result is inde- 
pendent of the specific value of (3: Simulations with j3 values 
ranging from 1 to 2, i.e., over the typical (3 range seen in AGN, 



yielded similar results. 

4. CONCLUSIONS 

In this Letter we have described two methods to determine 
the significance of possible quasi-periodic signals in the red- 
noise lightcurves observed from Active Galactic Nuclei using 
Monte Carlo simulations; a frequency-dependent "local signif- 
icance" test and a "global significan ce" test. Reiterating ar- 
guments by rimmer & Konig (1995 ), we showed that "phase 
randomization" techniques should not be used for the genera- 
tion of simulations since the resulting lightcurves do not exhibit 
true red-noise characteristics. The periodograms of lightcurves 
produced by this method do not obey the \ 2 statistics which re- 
sults from a random process, overestimating therefore the sig- 
nificance of peaks present in the periodogram of an underlying 
red-noise lightcurve. Inst ead of "phase randomizat ion" we rec- 
ommend the algorithm of Timmer & Konig (1995 ) to simulate 
red-noise lightcurves with the correct statistical properties. 

The "local significance" test based on red-noise power sim- 



ulations generated with the [Timmer & Konig (1995[ ) algorithm 
shows that the 4200 s feature in Mrk 766 is not significant at 
the 95% level. This statement holds for both, PSD and x 2 {P) 
analysis (Fig. |J). The "global significance" of the 4200 s fea- 
ture is 45%, i.e., higher peaks in the 2000-5000s period range 
are found in roughly half of all simulated lightcurves. Thus, 
the presence of a peak in a red-noise \ 2 (P) or PSD is far from 
unusual. We are therefore led to the conclusion that we can- 
not confirm the claim of a ^4200 s QPO. Rather, we attribute 
the peak at 4200 s to a random occurrence which is due to the 
red-noise character of AGN data. 

We are currently in the process of checking the kilosecond 
QPO detections claimed for AGN using archival data and the 
methods outlined in this Letter. We will report on our results in 
a forthcoming publication. 
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